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\ RING CAPACITY EFFICIENT ARCHITECTURE 

BACKGROUND OF THE INVENTION 

1. Field of Invention 

This invention relates to methods and systems for providing fault-tolerant ring 
5 structures. 

2. Description of Related Art 

Synchronous optical network (SONET) rings have been used for many years as a 
survivable transport architecture to support high-reliability services. A SONET ring is a 
closed loop of nodes supporting a protocol that allows fast rerouting in the event of 
10 failures. Currently, SONET rings are provisioned with each node having service line 
interfaces and protection line interfaces in each direction. SONET requires service and 
protection resources to be fully provisioned to properly function. Accordingly, new 
technology is needed for providing fault tolerance with less resource requirements. 

SUMMARY OF THE INVENTION 
1 5 The present invention provides methods and systems for provisioning fault- 

tolerant ring structures that require less than fully provisioning of all spans of the ring 
with service capacity. 

In various exemplary embodiments, a plurality of nodes can be connected in a 
ring structure. The ring structure can allow a ring to be provisioned without a continuous 
20 loop of service capacity around all legs of the ring. This postpones or totally avoids the 
cost of service capacity until a demand requires it. Only protection capacity is required to 
close the loop, and the decision to add working capacity can be made as warranted. At 
the same time, this ring structure can retain its fault-tolerant nature and also provide 
acceptable performance. 
25 BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is described in detail with reference to the following figures, where 
like numerals reference like elements, and wherein: 

Fig. 1 is an exemplary embodiment of a fully provisioned SONET ring network; 
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Figs. 2-3 illustrate various fault situations for the exemplary fully provisioned 
SONET ring shown in Fig. 1; 

Fig. 4 is an exemplary embodiment of a partially provisioned SONET ring 
network; 

Figs. 5-6 illustrate various fault situations for the exemplary partially 
provisioned SONET ring shown in Fig. 4; 

Fig. 7 is block diagram of an exemplary node; and 

Fig. 8 depicts a flowchart outlining an exemplary technique for providing linkage 
restoration. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

SONET ring networks have been used for many years as a survivable transport 
architecture to support high-reliability (fault-tolerant) services. By providing a fault- 
tolerant protocol that allows for service capacity to be provisioned as needed, transport 
costs associated with provisioning a SONET network can be considerably reduced. 

Fig. 1 shows an exemplary embodiment of a fully provisioned SONET ring 
network 100. The fully provisioned SONET ring network 100 includes a plurality of 
nodes connected in a ring structure. As shown in Fig. 1, the exemplary ring structure 
includes nodes 1 10-160. Although Fig. 1 shows an exemplary ring structure with a 
particular number of nodes, it should be appreciated that the ring structure can include 
any number nodes consistent with existing ring standards. 

As shown in Fig. 1, each node is connected to adjacent nodes by 
bi-directional interfaces. Service links 111-161 carry service traffic in a bi-directional 
manner and are provisioned across every span of the SONET ring. In addition, the 
protection links 1 16-166 carry protection traffic in a bi-directional manner and are 
provisioned across every span of the SONET ring. Thus, there are two closed loops 
around the ring of service capacity and protection capacity. 

The nodes 1 10-160 can be any known or later developed mechanism for receiving 
and transmitting information over their respective links 1 1 1-166. The nodes 1 10-160 can 
be any one of a number of different types of nodes, such as SONET terminals, digital 
cross-connect systems (DCS), add/drop multiplexers (ADM's), asynchronous transfer 
mode (ATM) switches, satellites, computers, routers, or any combination of software and 
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hardware capable of generating, relaying, recalling from storage any information capable 
of being transmitted to the network 100 without departing from the spirit and scope of the 
present invention. However, for the examples below, the nodes 1 10-160 are add/drop 
multiplexers (ADM's). 

The links 1 1 1-166 can be any known or later developed devices or systems for 
transmitting data between the plurality of nodes 1 10-160. Such devices include fiber- 
optic cable connections and dense wave division multiplexer (DWDM) connections. 
In general, the links 1 1 1-166 can be any known or later developed connected systems, 
computer programs or structures usable to connect the nodes 1 10-160 in a ring structure. 

Communication paths within the SONET ring structure 100 can occasionally fail 
Upon such failures, the SONET ring network 100 can quickly self-repair by establishing 
new internal communication paths. The restoration process starts as a node 1 10-160 
detects a fault using fault detection devices situated at the node or elsewhere. Upon 
detection of a problem, the SONET ring network 100 reroutes communication according 
to a predefined protocol. Under this protocol, if the fault is detected on a service link and 
the corresponding protection link is available, then communication is diverted from the 
service link to the protection link, thereby restoring communication. However, if there is 
a complete span failure somewhere within the path, the traffic can be looped back onto 
the protection ring by the node detecting the problem, and traverse an alternate path in the 
opposite direction. 

Fig. 2 shows an exemplary embodiment of a fully provisioned SONET ring 
network 100 experiencing a service fault. As shown in Fig. 2, a service fault 125 exists 
between nodes 120 and 130. This fault condition 125 prevents service traffic from being 
reliably transmitted between nodes 120 and 130. However, by rerouting the service 
traffic between nodes 120 and 130 to the protection link 126, the network can self-repair. 

For example, it might be desirable to transmit a message from node 1 10 to node 
140 of this network 100. In normal operation, the message might be added to traffic at 
node 1 10, transmitted along the service line 1 12 to node 120, then along the service line 
122 to node 130, and finally along the service line 132 to node 140. The message could 
thereupon be dropped at node 140. However, in this example, where a service fault 125 
exists between nodes 120 and 130, the traffic on service link 1 1 1 must be switched to 
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protection link 126 and this message can travel along protection line 127 to avoid the 
fault 125. In this manner, this message can be transmitted reliably between nodes 1 10 
and 140 even though there is a service fault. 

Fig. 3 shows the exemplary embodiment of a fully provisioned SONET ring 
network 100 experiencing a complete span failure (e.g., a fiber cut, a node failure). As 
shown in Fig. 3, a complete span failure exists between node 120 and node 130. When a 
complete span failure is detected, service traffic is diverted to the protection ring and then 
looped back. 

As discussed above, a message can be added to traffic at a particular node and 
then dropped at the destination node. For example, if a message is to be transmitted from 
node 1 10 to node 140 of the network, the message can be added to traffic at node 110 
and, for example, traverse the path from node 1 10 along service line 1 12 to node 120, 
then from node 120 along service line 122 to node 130, and finally from node 130 along 
service line 132 to node 140. However, if there is a complete span failure somewhere 
within this path, the traffic can be looped back onto a protection link by the node 
detecting the problem, and traverse an alternate path in the opposite direction. 

For example, assume a message is to be transmitted from node 1 10 to node 140 
but that a complete span failure exists between node 120 and node 130. In this example, 
the traffic can be looped back at node 120 (where the fault is detected), and redirected to 
the protection link 1 16 and thereafter traverse the alternate path from node 120 along 
protection line 1 18 to node 1 10, then from node 110 along protection line 168 to node 
1 60, from node 160 along protection line 158 to node 150, and finally from node 150 
along protection line 148 to node 140. In this manner, the ring structure can provide the 
necessary linkage restoration to ensure reliability and the message can arrive at the 
desired destination node. 

Fig. 4 shows an exemplary partially provisioned SONET ring network 200. The 
exemplary partially provisioned SONET ring network 200 includes a plurality of nodes 
connected in a ring structure. As shown in Fig. 4, the exemplary ring structure includes 
nodes 210-260. The exemplary partially provisioned SONET ring network 200 allows a 
ring to be provisioned without a continuous loop of service capacity around all legs of the 
ring. This allows for the postponement of providing service capacity until a demand 
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requires it. Only protection capacity is required to close the loop, and the decision to add 
working capacity can be decided on a link-by-link basis. Moreover, the ring can continue 
to meet the same performance requirements as before. 

In general, a message can be transmitted along any path for which service has 
been provisioned. Thus, as shown in Fig. 4, service traffic may be transmitted from node 
210 to node 230 and from node 240 to node 260 because service links 21 1, 221, 241 and 
251 exist along these paths. However, since the path between node 230 and 240 is not 
provisioned with service capacity, service traffic cannot be transmitted from node 230 to 
node 240 (in either direction) unless, of course, the protection fibers are used for data 
traffic that may be pre-empted in case of a fault. 

The exemplary partially provisioned SONET (PPSONET) ring network 200 
requires several changes to the conventional approach. In particular, a new ring capacity 
management capability is required to provision the protection channels only, and to add 
or delete service channels as needed on any specific span of the ring. In general, as far as 
the ring-switching protocols are concerned, the unprovisioned service channels can act as 
if they are in a normal state, without any signal fail or signal degrade conditions. Thus, 
standing alarms are not generated for spans that have only protection capacity. Traffic 
can be routed on the provisioned service channels and extra traffic for pre-emptable 
demands can still be routed on the protection channels even when there is no 
corresponding service channel. Additionally, on spans that do not have the service 
channel provisioned, the data communication channel (DCC) can be routed on the 
protection channel. Furthermore, messages sent along a path to verify the integrity of the 
path (e.g., k-byte messages) will be sent on the protection links. 

Fig. 5 shows the exemplary PPSONET ring network 200 experiencing a service 
fault 225. As shown in Fig. 5, the fault condition 225 prevents traffic from being reliably 
transmitted in the easterly direction between nodes 220 and 230. However, by rerouting 
the service traffic to the protection link 226 to avoid fault 225, the network can self- 
repair. The restoration process is similar to the conventional approach taken by the 
exemplary fully provisioned SONET ring network 100. 

Traffic can be rerouted from a service link to a protection link when a fault 
condition is detected. Normally, for example, if a message is to be transmitted from node 
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210 to node 230, the message can be added to traffic at node 210, transmitted along 
service line 212 to node 220, then from node 220 along service line 222 to node 230, 
where the message can be dropped. However, if at node 220 a service fault 225 is 
detected, traffic on service link 21 1 can be switched to protection link 226 and this 
message can travel along protection line 227 to avoid the fault 225. In this manner, a 
PPSONET ring network can provide the same level of tolerance for service faults as 
before. 

Fig. 6 shows an exemplary PPSONET ring network 200 experiencing a complete 
span failure. As shown in Fig. 6, there is a complete span failure between nodes 220 and 
230 of this network 200. However, even in this catastrophic situation, the PPSONET ring 
network 200 can provide the necessary linkage restoration. 

In operation, a message can be transmitted along any path of the exemplary 
PPSONET ring network 200 for which service has been provisioned. Thus, as shown in 
Fig. 6, service traffic can be transmitted from node 210 to node 230 and from node 240 to 
node 260. However, since the path between nodes 230 and 240 is not provisioned, 
service traffic cannot be transmitted from node 230 to node 240. 

If it is desirable to transmit a message from node 21 0 to node 230, for example, 
the message normally can be added at node 210 and transmitted to node 230 where it can 
be dropped. However, if there is a complete span failure between node 220 and node 
230, then traffic can be rerouted by node 220 (where the fault is detected) to the 
protection link 216 and looped back. In this manner, a message that is sent from node 
210 to node 230 can be transmitted to node 230 even though there is a complete span 
failure. 

In this example, the message can be looped back at node 220 and travel from node 
220 along protection line 218 to node 210, from node 210 along protection line 268 to 
node 260, from node 260 along protection line 258 to node 250, and finally from node 
250 along protection line 248 to node 240. As can be seen, so long as all legs of the ring 
are provisioned with protection capacity, the loopback routine can be used. Thus, it is not 
necessary to fully provision the ring with service capacity in order to enjoy the benefits of 
a high-reliability ring structure so long as all legs of the ring are provisioned with 
protection capacity. 
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Fig. 7 is a block diagram of an exemplary node 700. The exemplary node 700 has 
a controller 710, a memory 720, a fabric 730, a fault detection unit 740, a local interface 
750, a capacity table 760, a western service interface 770, a western protection interface 
775, an eastern service interface 780, and an eastern protection interface 785. The 
controller 710 is linked to the other devices 720 to 785 by the data/control bus 715. 
While Fig. 7 shows a bus architecture, any type of architecture may be used and 
implemented using any type of technology such as application specific integrated circuits 
(ASIC), PLAs, PLDs, etc. as is well known to one of ordinary skill in the art. 

In various exemplary embodiments, not all nodes of the ring structure need to be 
provisioned with service capacity. For each node, the capacity table 760 can have entries 
indicating the particular provisioning of the node. For example, if the eastern service 
interface 780 is not provisioned with service capacity, an entry that this service interface 
is unprovisioned can be made in the capacity table 760. It should be appreciated that the 
information in the capacity table 760 relating to the provisionment of the node can be 
changed as decisions are made to add/delete service capacity. 

Generally, as far as the ring-switching protocols are concerned, the unprovisioned 
service channels can act as if they are in a normal state, without any signal fail or signal 
degrade conditions. Thus, standing alarms are not generated for spans that have only 
protection capacity. Traffic can be routed on the provisioned service channels and extra 
traffic for pre-emptable demands can still be routed on the protection channels even when 
there is no corresponding service channel. Additionally, on spans that do not have the 
service channel provisioned, the data communication channel (DCC) will be routed on 
the protection channel. Furthermore, messages sent along a path to verify the integrity of 
the path (e.g., k-byte messages) will be sent on the protection links. 

In a first mode of operation, data continuously flows into the node 700 through 
the various interfaces 770, 775, 780 and 785. The fabric 730 can drop any portions of 
this tributary data that are to be dropped at the node and output this data via the local 
interface 750 to external devices (not shown) via link 752. In addition, the fabric 730 can 
add data received through the local interface 750. This added data would be sent by the 
fabric 730 to the appropriate interface to be routed to the "next node" in the ring structure. 
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While the exemplary fabric 730 is essentially an optical device, it should be appreciated 
that the fabric 730 can be a purely electrical switching device accommodated by 
electrical-to-optical and optical-to-electrical transducers or the fabric 730 can be any 
combination of optical and electrical technologies without departing from the spirit and 
scope of the invention. 

Ordinarily, the received information would be directed to the next node of the ring 
adjacent to the present node. For example, if data arrives via link 771 of the western 
service interface 770 5 this data would ordinarily be transmitted via link 781 of the eastern 
service interface 780 to the node connected to this link 781 (unless, of course, it is to be 
dropped at the present node). If the traffic patterns require a service interface that is not 
currently provisioned, then a provisioning operation is required to create the appropriate 
service interface and keep track of it in the capacity table 760. 

The controller 710 interacts with the fault detection unit 740 to determine whether 
any fault condition exists between the present node and the next node. The types of error 
conditions that can be measured by the fault detection unit 740 include single bit errors, 
octet errors, cyclic redundancy check (CRC) errors, checksum errors, framing errors and 
loss of signal errors. However, as communication standards evolve and new standards 
develop, it should be appreciated that any error condition or failure capable of being 
measured can be used without departing from the spirit and scope of the present 
invention. 

If the fault detection unit 740 detects a fault, the controller 710 can suspend the 
transmission of certain non-critical types of data, such as non-critical communications 
that may be retransmitted if lost, network status messages, and best-effort type of 
communications. However, if the type of data to be transmitted is not pre-emptable, then 
the controller 710 proceeds with linkage restoration. If the fault condition indicates that a 
service fiber is not usable but the corresponding protection fiber is available, then 
controller 710 directs transmittal of the information to the next node of the ring but along 
the protection link instead of the service link. In the event that the fault detection unit 
740 determines that there is a complete span failure, controller 710 directs the 
information to the prior node of the ring in the opposite direction along the protection 
link. 
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Fig. 8 depicts a flowchart outlining an exemplary embodiment of a technique for 
providing a fault-tolerant ring structure. Beginning at step 810, a transmission request is 
received indicating that information is to be transmitted to another node. 

In step 820, a determination is made as to whether a service fiber is available to 

5 transmit the information. In order to determine whether service is available, a capacity 
table can be queried to determine whether the service capacity is provided. Assuming 
that service capacity is provided, the node can then check for a service fault condition. If 
service is provisioned and no fault has been detected (i.e., service is available), control 
continues to step 830; otherwise, control continues to step 840. 

10 In step 830, the service mode is activated. Once the service mode is activated, 

any information that is thereafter transmitted is on the service fiber. Control continues to 
step 860. Next, in step 860, the information can be transmitted along the service fiber to 
the next node of the ring structure. 

However, if service is not available, then the protection mode is activated in step 

1 5 840. Next, in step 850, a determination is made as to whether there is a complete span 
failure (e.g., a fiber cut, a node failure). If there is not a complete span failure, then 
control passes to step 860 where the information is transmitted along the protection fiber 
to the next node of the ring. However, if there is a complete span failure, then control 
passes to step 870 where a ring loopback function operates to transmit the information 

20 along the protection fiber to the prior node of the ring. Control continues to step 880 
where the operation stops. 

While this invention has been described in conjunction with the exemplary 
embodiments outlined above, it is evident that many alternatives, modifications and 
variations of the invention, as set forth above, are intended to be illustrative, and not 

25 limiting. Various changes may be made without departing from the spirit and scope of 
the invention. 



